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Abstract 

We define the notion of local property reconstruction for parametrized graph properties, and 
show that there are local reconstructors for the properties of connectivity, strong connectivity 
in directed graphs, fc-cdgc-connectivity, and small diameter in undirected graphs. Additionally, 
we obtain tolerant property testers for these properties by using our local reconstructors in 
conjunction with the corresponding property testers. 
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1 Introduction 



We define the notion of local reconstruction for parametrized graph properties, which extends the 
notion of local property restoring, found in [5]. Suppose we are given a very large graph G that is 
promised to be close to having a property V s (e.g. Vd might denote the property of having diameter 
at most D). Local reconstruction algorithms provide very fast query access to a "corrected" version 
of G. That is, the local reconstruction algorithm should have in mind some G which has the property 
V s and is also close to the original graph G. The goal of the local reconstruction algorithm is to 
provide very fast query access to the edges of G — that is, given a pair of vertices u, v in G, the 
algorithm should in sublinear time determine whether the edge (u, v) is in G. It can be useful 
to relax the condition that G has property V s and only require that G has property Vm s \ which 
contains V s but is possibly larger. For instance, if Vd is the property of having diameter at most 
D, we might only require G to have property V&d, i.e. having diameter at most 4D. We call such 
an algorithm a local reconstructor for V s . 

In this paper we study local reconstruction algorithms for some of the most basic problems in 
graph theory, namely connectivity in undirected graphs, strong connectivity in directed graphs, k- 
connectivity in undirected graphs, and small diameter in undirected graphs. Such algorithms might 
be used to efficiently repair connectivity or small diameter in graphs. These are common issues 
that have been considered in various models in wireless networks and robotics (see for example 

En eii). 

Our results. Our specific results are the following. 

• We present a general way of transforming a local reconstructor for a parametrized property 
and a property tester for the same property into a tolerant tester. Such a result was first 
proven in [5] for the special case of non-parametrized properties. 

• We give local reconstructors for the following properties: connectivity in undirected graphs; 
strong connectivity in directed graphs; ^-connectivity in undirected graphs; having diameter 
at most D, for some diameter parameter D, in undirected graphs. 

• We give tolerant testers for the above properties. 

A property tester for a property V takes as input a graph G and parameter e, and accepts with 
high probability if G has V and rejects with high probability if G is e-far from having V . Similarly, 
a tolerant tester gets a graph G and parameters ei < 62, and accepts with high probability if G is 
ei-close to having V and rejects with high probability if G is e2-far from having V . 

Brakerski's idea (see [5]) behind the construction of the tolerant tester is to attempt to use the 
local reconstructor on G to get G which has the property and is close to G. If G is indeed close 
to having the property, then the attempt will be successful. If G is far from having the property, 
then either G will be far from having the property (which can be detected by running the property 
tester on G) or G will be far from G (which can be detected via sampling). 

For each of the above properties, the strategy for constructing a local reconstructor is the same. 
First, we find some graph parameter that must be bounded if the graph is close to a property (e.g. a 
graph that is close to being connected cannot have too many connected components). Then, we find 
a systematic way of adding edges to the graph to guarantee that the reconstructed graph has the 
desired property in such a way that the number of edges added depends on the parameter we bound 
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(e.g. designate a "super-node" to which we add edges; then add just a few edges per connected 
component, by exploring the connected component and electing leaders). We use this relationship 
to guarantee that the resulting graph is still close to the original graph. Since previous works ([8], 
[3], [32], P2]) give property testers for these properties we study, we obtain, as corollaries, that 
these properties have tolerant testers. 

Related work. The notion of locally reconstructing a data set was introduced under the name 
local filter in pQ, where the property considered was monotonicity of sequences. A model of lo- 
cal filters in which the requirements are strengthened is presented in [20], where the property of 
monotonicity is again considered. A closely related work is [5], which introduces the concept of 
local reconstruction under the name local restoring and also shows a special case of our relationship 
between local reconstructors, property testers, and tolerant testers. Several other properties of 
graphs, functions, and geometric point sets have been studied in the context of local reconstruction. 
A local reconstruction algorithm for expander graphs is given in |10j . [5] gives local reconstructors 
for the properties of bipartiteness and p-clique in the dense graph model, as well as monotonicity. A 
more recent work ([9]) concerns locally reconstructing Lipschitz functions. On the geometric side, 
[6] studies local reconstruction of convexity in two and three dimensions. The problem of testing 
the properties for which we give local reconstructors has been studied in multiple works — [8] for 
connectivity, [3] for strong connectivity, [8] and [12] for fe-connectivity, and [17] for diameter. 

Organization. In SectionEl we formally define our model and the notions of local reconstructors, 
property testers, and tolerant testers, and in Section [3] we formally state our main results. In 
Section [4] and El we present our results for connectivity and strong connectivity. Section [6] serves 
as a brief interlude where we show how our local reconstructor for connectivity can be modified to 
give a neighbor oracle for the corrected graph G' . This procedure will then be used in Sections [7] 
and [H in which we present our results for /c-connectivity and small diameter respectively. 

2 Preliminaries 

We adopt the general sparse model of graphs as presented in [17], i.e. the graph is given as an 
adjacency list, and a query for a vertex v is either its degree, or an index i on which the i-th 
neighbor of v is returned (with respect to the representation of the neighbors as an ordered list). 
We assume there is some upper bound m on the number of edges of the graphs we work with, 
and distances are measured according to this, i.e. if k is the minimum number of edge deletions 
and insertions necessary to change one graph to the other, then their distance is k/m. We assume 
m = Q(n) where n is the number of vertices in the graph. 

Definition 2.1 ([17]). The distance between two graphs G±,G2, denoted dist(Gi, G2), is equal to 
the number of unordered pairs (u, v ) such that (u, v) is an edge in one graph but not in the other, 
divided by m. A property is a subset of graphs. Throughout this paper we say that a graph has 
property V if it is contained in the subset V . The distance between a graph G and a property 
V, denoted dist(G,P), is equal to dist(G,V) = min G / gP dist(G, G'). If dist(G,P) < e, then G is 
e-close to V, otherwise G is e-far from V. 
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Parametrized properties. Our result relating local reconstructors to tolerant testers (Theo- 
rem [37T]) generalizes a result of Brakerski ([5]) to parametrized properties. A parametrized property 
V s is a property belonging to a family {V s } s of properties parametrized by some parameter s. 
For example, the property Vd of having diameter at most D is a parametrized property, with the 
diameter D as the parameter. 

2.1 Local reconstructors 

Definition 2.2. For an undirected graph G = (V, E), the neighbor set of v G V is the set Ng(v) = 
{u G V | (u,v) G .E}. For a directed graph G = (V,E), the in-neighbor set and out-neighbor set of 
v G V are respectively iV£?(v) = G F | G E} and N G ut (v) = {u £ V \ (v,u) £ E} 

Definition 2.3 (Neighbor and edge oracles). A neighbor oracle for a graph G is an algorithm 
which, given query v G V and either query deg or i, returns deg(f) or the i-th neighbor of v 
(with respect to some fixed ordering of the neighbor set) in O(l) time. An edge oracle for G is an 
algorithm Eq which returns in 0(1) time Eq{u,v) = 1 if (u, v) G E and Eq(u,v) = otherwise. 

One can use a neighbor oracle to implement an edge oracle with query complexity deg(u), since 
given a query pair (u, v), one can check if u G Ng(v) with deg(v) queries to the neighbor oracle. We 
now formally define local reconstructors. Roughly speaking, a local reconstructor uses a neighbor 
oracle for G, which is close to V, to implement an edge oracle for G G V which is close to G. 

Definition 2.4. Let ei,€2, 5 > and let 4> : N — > N. An (ei, £2, 5, (f){-))-local reconstructor (LR) 1Z 
for a parametrized graph property V s is a randomized algorithm with access to a neighbor oracle 
of a graph G that is ei-close to V s , which satisfies the following: 

• 1Z makes o(m) queries to the neighbor oracle for G per query to 1Z 

• There exists G G with dist(G, G) < €2 such that 1Z is an edge oracle for G, with 
probability at least 1 — 5 (over the coin tosses of TZ) 

An (ei, 62, 5) -local reconstructor for a non-parametrized graph property V is simply a (ei, €2, 6, </>(■))- 
local reconstructor where V is viewed as the only property in its parametrized family and <j> is the 
identity function. The query complexity of the local reconstructor is the number of queries 7Z makes 
to the neighbor oracle for G on any query (u,v). We note that this definition differs from that 
of [9] because even if G G V, the reconstructed graph G may not equal G in general. 

2.2 Tolerant testers 

Definition 2.5. Let €±,€2 > and let (/) '■ N — » N. An (ei, 62, -tolerant tester T for a 
parametrized graph property V s is a randomized algorithm with query access to a neighbor or- 
acle of an input graph G that satisfies the following: 

• T makes o(m) queries to the neighbor oracle for G 

• If G is ei-close to V s , then Pr[7~ accepts ] > | 

• If G is e2-far from V^^, then Pr[T accepts ] < 3 
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For a non-parametrized graph property V, an (ei, £2) -tolerant tester is defined similarly by viewing 

V as the single member of its parametrized family and taking <p to be the identity function. 

For a parametrized graph property, an (e, (j>(-)) -property tester is simply a (0, e, 0(-))-tolerant 
tester, and for a non-parametrized graph property, an e-property tester is defined analogously. 

3 Local reconstructors and tolerant testers 

We now show that our notion of local reconstructors can be used alongside property testers to 
construct tolerant testers for properties of sparse graphs. This idea is not new and can be found 
as [5J Theorem 3.1], but we extend the result for parametrized properties. 

Theorem 3.1. Let V s be a parametrized graph property with an (ei, €2, 5, (p(-))-local reconstructor 1Z 
with query complexity qn and suppose Vm s \ has a (e' , ip (•)) -property tester T with query complexity 
q-f. Then for all (3 > 0, V s has an (ei, £2 + e' + (3, (ip o </>)(•)) -tolerant tester with query complexity 
0{{l/(3 2 + q T )q n ). 

Proof. The algorithm and proof follows that of [5j Theorem 3.1]. 
The tolerant tester for V s is as follows: 

1. Run 1Z on G and estimate dist(G, G) to within additive error of /3/2 by sampling (u,v) € 
V x V. 

2. If estimate of dist(G, G) exceeds €2 + /3/2, reject. 

3. Run T on G using 7Z, and accept if and only if T accepts. 

If G is ei-close to V s , then with high probability dist(G, G) < ei- Therefore, the algorithm passes 
step 2 and with high probability T accepts G, since G € V^ s y If G is (e2 + e' + /3)-far from V^uu)), 
then either dist(G, G) > 62 + /3/2, in which case step 2 fails with (constant) high probability, or 
dist(G, G) < £2+/3 in which case G is e'-far from Vm^s)) and so T rejects with high probability. □ 

Taking cj) and tp to be identity, one gets as a special case the result of [5j Theorem 3.1] that if 

V is a graph property with an (ei, €2, <5)-local reconstructor and an e'-property tester, then for all 
j3 > it has an (ei, €2 + e' + /3)-tolerant tester. 

In this work, we give local reconstructors for several graph properties: connectivity in undirected 
graphs, strong connectivity in directed graphs, and small diameter in undirected graphs. To be 
precise, we prove the following in Sections HI [U [3 and [8] respectively. 

Theorem 3.2. There is an (e, (1 + a)e,5)-LR for connectivity with query complexity O (j^) • 

Theorem 3.3. There is an (e, (4+a)e, 5)-LR for strong connectivity with query complexity O (3—) • 

Theorem 3.4. There is an (e, (2 + a)e+c/c/2, k{5+^f))-LR for k- connectivity with query complexity 
°((( 1 c+ l ) k) k t 3k (t + k) k log k (t + k)log k (Cn)) forn>\, where C = ln(1/( 1 1 _ 7 ) ) andt= 1 ^. 

Theorem 3.5. There is an (e, (3 + ot)e + — + c, S + -^,<f)(s) = 2s + 2)-LR for diameter at most D 
with query complexity 0(^j^A°( Alog A ) log n) where A = (d/e) 0<yl l^ and d = 2m/n is the bound 
on average degree. 
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Combining Theorem 13.11 with each of Theorems 13.21 13.31 13.41 and I3.5[ along with property 
testers for each of the four properties (see [8], [Bj, [12], and pZ]), we immediately obtain the 
following tolerant testers. 

Corollary 3.6. For all a, f3, e > 0, there is an (e, (1 + a)e + fi) -tolerant tester for connectivity. 

Corollary 3.7. For all a, f3,e > 0, there is an (e, (4+ a)e+ /?) -tolerant tester for strong connectivity. 

Corollary 3.8. For all a,/3,c,e > 0, there is an (e,(2k + a)e + ck/2 + /3) -tolerant tester for 
k- connectivity. 

Corollary 3.9. For all D, a, j3, e > and constant c < 1, there is an (e, (3+a)e+ ^ +c+/3, 4D + 6)- 
tolerant tester for diameter at most D, for n > ~. 

4 Local reconstruction of connectivity 

In this section we prove Theorem l3.2[ We begin by giving the high level description of the algorithm 
and then we present the implementation and analysis of the algorithm. 

4.1 High level description 

The basic idea behind the algorithm is as follows. We designate a "super-node" vq and add edges 
from a few special vertices to Vq so that the resulting graph is connected. Ideally, we have exactly 
one special vertex in each connected component, since this number of edges is both necessary and 
sufficient to make the graph connected. Therefore, we reduce the problem to defining a notion of 
"special" that can be determined quickly, that ensures that at least one node per component is 
special and that likely not too many extra nodes per component are special. How does a given vertex 
know whether it is special? Our algorithm tosses coins to randomly assign a rank r(v) G (0, 1] to 
each v G V{G). Then v can explore its connected component by performing a breadth-first search 
(BFS) and if v happens to have the lowest rank among all vertices encountered, then v is special. 
The only problem with this approach is that if v lies in a large connected component, then the 
algorithm makes too many queries to G to determine whether v is special. We fix this by limiting 
the BFS to K vertices, where K is a constant depending only on a few parameters, such as success 
probability and closeness. We then show that components larger than size K do not contribute 
many more special vertices. 

4.2 Algorithm 

We now give the algorithm. We first do some preprocessing. In particular, we arbitrarily fix some 
vertex vq G V(G). Additionally, we have a random oracle which, for each vertex v G V(G), it 
assigns a random number r(v) G (0, 1]. 



procedure Connected (v\, V2) 




if Eg{v\,V2) = 1 then 


> if edge is already in graph, it stays in the graph 


return 1 




else 
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if vq $l \v\1V2\ then > do not add edge if neither endpoint is vq 

return 
else 

Let v £ {vi,v 2 } \ {v } 

BFS from v up to K vertices and let U be the set of vertices visited 

if r(v) < r(u) for all u € U then > check if v is special 

return 1 
else 

return 
end if 
end if 
end if 
end procedure 



The following lemma shows that the procedure does not add too many extra edges. To prove 
it, we use the fact that if G has at least em + 2 connected components, then G is e-far from being 
connected. 

Lemma 4.1. With probability at least 1 — 5, Connected adds at most n/5K + em + 1 edges. 

Proof. Let X be the number of edges added by the local reconstructor. Call a connected component 
C small if |C| < K and large otherwise. Let Y be the number of edges contributed by large 
components. Each small component contributes exactly one edge, hence X = i + Y where i is 
the number of small components. Since G is e-close to being connected, i < em + 1. Moreover, 
each vertex in a large component contributes an edge with probability at most 1/K. By Markov's 
inequality, 



Pr 



hence 



Pr 



Y > 



11 



n 

Ik 



< Pr 



X > -— + em + 1 
OK 



Y > 



< Pr 



E[Y] 



Y > 



< 5. 



n 

6K 



< 6. 



□ 



Proof of Theorem \3.2\ . Let V be the family of connected graphs on n vertices. The local recon- 
structor 1Z will run Connected with K = s = O (-J-). Clearly 1Z has query complexity 

O(K) = 0(g^A. To see that G is connected, observe that, on each connected component, the 
rank function attains a minimum on some vertex, and that vertex therefore must get an edge to vo. 
Furthermore, if G is e-close to V, then by Lemma 14. 1 1 with probability at least 1 — 5 the procedure 
will add no more than (1 + a)em edges and hence dist(G, C?)<(1 + a)e. □ 



5 Local reconstruction of strong connectivity 

In this section we prove Theorem 13.31 We first go over some preliminary definitions and properties 
of directed graphs. Then we give the high level description of the algorithm. Finally we end by 
presenting the implementation and analysis of the algorithm. 
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5.1 Preliminaries 



Throughout this section, we will use arc to mean directed edge. 

Definition 5.1. A directed graph G is connected if it is connected when viewing arcs as undirected 
edges, and it is strongly connected if there is a path between every ordered pair of vertices. A con- 
nected component of G is a maximal connected subgraph of G, and a strongly connected component 
is a maximal strongly connected subgraph of G. 

Definition 5.2. A vertex is a source (sink) if it has no incoming (outgoing) arcs. A strongly 
connected component with no incoming (outgoing) arcs is a source (sink) component. 

Query model. In our model, we assume our neighbor oracle has access to both the in-neighbor 
set and the out-neighbor set. This allows us to perform both backward and forward depth-first 
search (DFS), as well as undirected BFS, which is a BFS ignoring directions of edges. 

5.2 High level description 

The basic idea behind the algorithm is as follows. As in the undirected connectivity case, we 
designate a "super-node" vq, but now we add arcs from a few special "transmitting" vertices to 
vq and also add arcs from vq to a few special "receiving" vertices. In order to make G strongly 
connected, we need to add at least one arc from the super-node to each source component and from 
each sink component to the super-node without adding too many extra arcs. A naive approach is 
to emulate the strategy for connectivity: to decide if v is a transmitter, do a forward DFS from 
v and check if v has minimal rank (and analogously for receivers and backward DFS). Again, we 
can limit the search so that large components may have some extra special vertices. The problem 
with this approach is that a sink component could be extremely small (e.g. one vertex) with many 
vertices whose only outgoing arcs lead to the sink. In this case, all of these vertices would be special 
and receive an edge to vq. Therefore we tweak our algorithm so that if v does a forward DFS and 
sees few vertices, then it checks if it is actually in a sink component. If so, then it is a transmitter; 
if not, then we do a limited undirected BFS from v and check minimality of rank. We then show 
that we do not add too many extra edges this way. 

5.3 Algorithm 

We now present the algorithm. We do the same preprocessing as in Connected, i.e. we arbitrarily 
fix a vertex vq € V(G) and have access to a random oracle that randomly assigns ranks r(v) E (0, 1] 
to each v € V(G). 



procedure StronglyConnected(«i, V2) 

if Eg(vi,V2) = 1 then > if edge is already in graph, it stays in the graph 





return 1 



else 



if Vq $l {vi,V2} then 

return 
else if V2 = vq then 



> do not add arc if neither endpoint is vq 



> check if v\ is a transmitter 
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Forward DFS from v\ up to K vertices 




if Forward DFS sees at least K vertices or InSmallSink(ui) then 


return 1 if v\ has lowest rank among 


the DFS vertices else return 


else 




Undirected BFS up to K vertices 




return 1 if v\ has lowest rank among 


the BFS vertices else return 


end if 




else if v\ = Vq then 


> check if wi is a receiver 


Backward DFS from v% up to K vertices 




if Backward DFS sees at least K vertices 


or InSmallSource^) then 


y»0"t~ iiy'vi 1 lTUn nat; 1 /"rtsrpci'i" ranV am oner 
IctUlll ± 11 U2 HCIO 1UWL.BI dillUllbi 


t hp I iT-T^ \Tf*vt"ir , (*fz c o t"*o^~ i i l" Ti 

Lilt 1—f -L kJ VCIIjIIjCQ) ClOt 1CIU1 11 \J 


else 




Undirected BFS up to K vertices 




return 1 if V2 has lowest rank among 


the BFS vertices, else return 


end if 




end if 




end if 




end procedure 





The procedure uses two subroutines: InSmallSource(u) and InSmallSink(z; ), which return 
True if v is in a source (respectively sink) component of size less than K. We will implement 
InSmallSink (InSmallSource is similar except reverse all the directions of edges) by running 
Tarjan's algorithm for finding strongly connected components, except stopping after only exploring 
all nodes reachable from v. If only one strongly connected component is returned, then return True, 
otherwise return False. This clearly runs in 0{K) time since Tarjan's algorithm runs in linear time 
and we are simply restricting the algorithm to the subgraph induced by all strongly connected 
components reachable from v. 

The following three lemmas capture the fact that if a directed graph is almost strongly con- 
nected, then it cannot have too many source, sink, or connected components, and therefore our 
algorithm likely does not add too many arcs. 

Lemma 5.3. If G is e-close to being strongly connected, then G has at most em source components 
and at most em sink components. 

Proof. Consider the directed graph G of strongly connected components of G. To make G strongly 
connected, G must have no sources or sinks, yet adding an arc eliminates at most one source and 
at most one sink from G. □ 

Lemma 5.4. If G is e-close to being strongly connected, then G has at most em + 1 connected 
components. 

Proof. If G is e-close to being strongly connected, then it is also e-close to being connected. □ 
Lemma 5.5. With probability at least 1 — 5, the procedure will add no more than + 4en + 2 arcs. 
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Proof. The analysis is similar to that of Lemma 15.51 except slightly more complicated. Let X be 
the random variable equal to the number of arcs added. Let S be the random variable equal to 
the number of arcs added that end at vq, and let T be the random variable equal to the number 
of arcs added that start at vq. We will focus on S, since the analysis for T is symmetrical. Any 
vertex v for which In Small Sink (v) is True lies in a sink component of size less than K; call these 
components small sink components. Also, any vertex that does an undirected BFS and sees less 
than K vertices must belong to a connected component of size less than K\ call these components 
small connected components. Let S' be the number of arcs counted by S contributed by vertices not 
in small sink components or small connected components. Then S < S' + 2em + 1 since each small 
sink component (of which there are at most em by Corollary 15. 3ft contributes at most 1 outgoing arc 
and each small connected component (of which there are at most em + 1 by Lemma f5.4p contributes 
at most 1 outgoing arc. Define T' analogously to S' , except for source components instead of sink 
components, so that T <T' + 2em + 1. Then we have E[S'} < and E[T'] < and therefore 



Pr 



In 

X > — + Aem + 2 
OK 



< Pr 

< Pr 

< 8 



S ' + T '>M 



S' + T' > 



E[S' + T'] 
5 



where the final inequality follows from Markov's inequality. □ 

Proof of Theorem EOI The local reconstructor 1Z runs StronglyConnected with K = Sat ™/ 2 -i • 

To see that G is strongly connected, note that every source component has some vertex of minimal 
rank, and this vertex will have the lowest rank among its backward ^^-neighborhood, hence every 
source component gets an arc from Do- By a similar argument, every sink component gets an arc to 
vq. Now we must show that with high probability we did not add too many edges. By Lemma [531 
with probability at least 1 — 5 we added at most (4+a)em edges and hence dist(G, G) < (4+a)e. □ 



6 Implementing a neighbor oracle with connectivity reconstructor 

Our local reconstructors for ^-connectivity and small diameter rely on the given graph G being 
connected. Even if G is not connected, it is close to being connected, and so one may hope to first 
make G into an intermediate connected graph G' using a local reconstructor for connectivity, and 
then run the local reconstructor for the desired property on G' to obtain G. We would therefore 
like a neighbor oracle for G' , but the local reconstructor only gives us an edge oracle for G' . We 
show how to modify the local reconstructor for connectivity to obtain a neighbor oracle for G' , with 
only a slight loss in the parameters achieved. 

As is, our connectivity reconstructor Connected from Section O is almost a neighbor oracle. 
Recall that the reconstructor works by selecting an arbitrary vq € G and adding edges from a 
few special vertices to vo — no other edges are added. For a vertex v ^ vo, Ng(v) C Nq>(v) C 
Ng(v) U {vq}, thus Nqi(v) can be computed in constant time. However, the problem arises when 
one queries Nc(vo). Potentially 0(n) edges are added to vq by the reconstructor, so computing 
Nc(v) queries Connected 0(n) times. We thus modify Connected to obtain the following. 
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Theorem 6.1. Fix a positive constant c < 1. There is a randomized algorithm N, given access to 
the neighbor oracle Nq for G that is e-close to being connected such that, with probability at least 
1 — 5, there exists a connected graph G' that is ((1 + a)e + c)-close to G and N is a neighbor oracle 
for G' , with query complexity O (33^). 

Proof. We modify Connected as follows. Instead of designating one super-node vq, we designate 
c • n super-nodes. Partition V into sets of size 1/c and assign each set in the partition to a distinct 
super- node. This can be implemented, for instance, by identifying V = {1, . . . , n}, designating the 
super-nodes to be {1, ... , cn}, and for a given vertex v E V, assign v to the super-node h(v) = \v/c\. 
For any v that would be connected to Vq, we instead connect it to h(v). Additionally, we add the 
edges (i,i+l) for alii € {1, . . . , cn— 1} to ensure connectivity. This adds a total of c— 1 edges, which 
constitute at most c-fraction of the edges. Call this modified local reconstructor Mod-Connected. 
It is straightforward to see that Mod-Connected has the same query complexity as Connected. 
We now implement an algorithm to compute Nqi as follows. Given a non-super-node v, its neighbor 
set could have grown by at most adding h(v), so Nq/(v) can be computed with O(l) calls to 
Mod-Connected. For a super node w, its neighbor set could have grown by at most 1/c + 2, since 
at most 1/c non-super-nodes could have been connected to w, and w is further connected to at most 
two super-nodes. Therefore Ng>(w) can be computed with 0(l/c) calls to Mod-Connected. □ 

7 Local reconstruction of ^-connectivity 

In this section we prove Theorem 13.41 We first go over some preliminary definitions and concepts 
related to ^-connectivity. Then we give the high level description of the algorithm, and finally we 
end by presenting the implementation of the algorithm. All graphs in this section are undirected 
unless otherwise specified. Throughout this section, we assume k > 1. 



7.1 Preliminaries 

For a subset U C V of the vertices in a graph G = 
to deg(U) = \{(u,v) eE\ueU,v E V \ U}\. 



(V,E), the degree ofU, denoted deg(£7), is equal 



Definition 7.1 (^-connectivity). An undirected graph G = (V,E) is k-connected if for every 
U C V, deg(U) > k. 

An equivalent definition of ^-connectivity, a result of Menger's theorem (see |13j). is that every 
pair of vertices has at least k edge-disjoint paths connecting them. An important notion in the 
context of fc-connectivity is that of an extreme set. Extreme sets are a generalization of connected 
components to the ^-connectivity setting. Connected components are 0-extreme sets. 

Definition 7.2. A set U C V is ^-extreme if deg(t/) = I and deg(M / ) > t for every W C U. 

It is straightforward from the definition that if a graph is (k — l)-connected and has no (k — 1)- 
extreme sets, then it is in fact /c-connected. Extreme sets satisfy some nice properties, which 
are used by [12] as well for property testing and distance approximation for /c-connectivity. Two 
extreme sets are either disjoint or one is contained in the other (see [15]. If W C U and W is 
^VK-extreme and U is £[/-extreme, then £\y > Ijj. Consequently, distinct ^-extreme sets are disjoint. 

One may hope there is some relationship between distance from ^-connectivity and the number 
of extreme sets, analogous to the relationship between distance from connectivity and the number 
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connected components. Indeed, for a (k — l)-connected graph, there is. A graph G that is (k — 1)- 
connected cannot have any ^-extreme sets for i < k — 1. Moreover, the number of additional edges 
required to make G £>connected is at least half the number of (k — l)-extreme sets in G. This is 
simply because each (k — l)-extreme set requires at least one additional edge, and adding an edge 
to G meets the demand of at most two such sets. 

7.2 High level description 

The idea behind the algorithm is to simply iteratively make the graph j-connected, for j = 
1,2, ...,k. Let Gj be the corrected j-connected graph obtained from G. It suffices to use a 
neighbor oracle for Gk-i to implement a neighbor oracle for Gk- The base case k = 1 is addressed 
by Section [6l 

Now suppose we have a neighbor oracle for (A; — l)-connectivity and we wish to implement a 
neighbor oracle for ^-connectivity. Again, we use a similar idea as in Sections H] and [6l Specifically, 
we fix a positive constant k/n < c < 1 and designate a set Vq C V of c ■ n > k super- nodes, 
connecting them in a certain way to make the subgraph induced by Vq fc-connected (details in the 
next subsection). The idea is then to ensure at least one vertex from each extreme set contributes 
a new edge to a super-node. Again, we implement this by assigning all vertices a random rank 
independently and uniformly in [0, 1) and searching a neighborhood of v up to t vertices, where t 
is appropriately chosen, and checking if it has minimal rank. Instead of doing this search via BFS, 
we use the extreme set search algorithm of [8]. The basic procedure satisfies the following: if v lies 
in a i-bounded extreme set, it finds this set with probability 0(t -2 ), otherwise it never succeeds. 
We iterate the basic procedure a polylogarithmic number of times. If every iteration fails (which 
happens if v does not lie in a i-bounded extreme set) then we tell v to connect to a super-node 
with probability ^ lp g OV* ) ^ _ \y e then show that with high probability the resulting graph Gk is 
fc-connected and that we do not add too many edges. This completes the edge oracle. We will 
also show how implement these ideas carefully so that the edge oracle can be transformed into a 
neighbor oracle in order to make the recursion work. 

7.3 Algorithm 

Given our previous discussion, all that remains to implement the algorithm is to implement the 
following tasks: 

• Search: Given v E V which lies in an extreme set S, find a neighborhood U C S containing 
v such that \U\ < t. 

• Decision: Given v & V, determine whether v should contribute an edge to Vo, and if so, to 
which v' gVo. 

7.3.1 Implementing the search task 

The goal of the search task is to detect that v lies in an extreme set of size at most t. We are now 
assuming the input graph is (k — l)-connected, so all extreme sets are (k — l)-extreme. The search 
task can be implemented by a method of [8] and |12j which runs in time 0(i 3 <ilog(i<i)) where d is 
a degree bound on the graph. Roughly, the procedure works by growing a set U' starting with {v} 
and iteratively choosing a cut edge and adding the vertex on the other end of the edge into U'. The 
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cut edge is chosen by assigning random weights to the edges, and choosing the edge with minimal 
weight. The procedure stops when the cut size is less than k or when \U'\ = t. The pseudocode 
for the extreme set search is as follows. 



procedure ExtremeSetSearch(?;, k 


1 \ 
- 1) 




For each edge, independently assign 


a random wei; 


dit uniformly from [0, 1) 


U> <- {„} 






repeat 






(u, w) <— arg min{wt(«i, U2) u\ 


G U',u 2 i U', 


[ui, 112) G E(Gk-\)} > Implicitly uses 


(k - l)-CONN 






U'<-U'\J {w} 






until \U'\=toT deg(U') < k 






end procedure 







Its running time is 0(td log (td)) but its success probability is only Q(t~ 2 ) (that is, the probability 
that U' = S when the procedure terminates), so the basic procedure is repeated Q(t 2 ) times or until 
success. We can check if each run is successful by checking that the final set U' is a (k — l)-extreme 
set. This procedure is adapted in [T7] to the general sparse model by noticing that no vertex of 
degree at least t + k would ever be added to 5, and hence the procedure has a running time of 
0(t 5 (t + k) log(i + k)). We actually want the probability that all vertices have a successful search 
to be at least 1 — 7/2, so we repeat the basic procedure 0(t 2 log(Cn)) times for C = wx /(i- 7 /2)) ' 
yielding a time complexity of 0(t 3 (t + k) log(i + k) log(Cre)). 



7.3.2 Implementing the decision task 

For the decision task, it is helpful to first think about how to do it globally. First, we must hash 
each v G V to a set h(v) of k super- nodes in Vo- This can be implemented as follows. Label 
V = {1, . . . , n} and Vo = {1, . . . , cn}, and define h(v) = { \v/c\ , \v/c] + 1, . . . , \v/c\ + (k — 1)} 
where the entries are taken modulo cn. The specific way we do this hashing is not important — it 
suffices to guarantee the following properties: (1) for every v G V, \h(v)\ > k and (2) for every 
super-node v' G Vo, there are at most a constant number, independent of n, of v such that v' G h(v), 
and that these v are easily computable given v'\ our method guarantees the constant ^, which is the 
best one can hope for given Property 1. Property 1 will be used later to ensure that the resulting 
graph is fc-connected (Lemma 17. 4p . Property 2 ensures, by the same reasoning as in Section [U that 
computing Nc h makes at most O Q + k) calls to Ec k . 

Now, to decide whether v should be connected to a super-node, do an extreme set search to 
find a neighborhood U, of size at most t, containing v, and check if v has minimal rank in U, 
where the rank is the randomly assigned rank given in the high level description. If so, then mark 
v as successful. If the extreme set search fails, i.e. all 0(t 2 logn) iterations fail, then mark v as 
successful with probability M*ZpZ*i where C = ^rnn^JW) a S a i n > which can be implemented by 
checking if the rank of v is less than '"(^"Z*) _ jf v j s successful, then find the lexicographically 
smallest super-node v' G h(v) to which v is not already connected. If none exist, then do nothing; 
otherwise, connect v to v' . 
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We also want the subgraph induced by Vq to be fc-connected to help ensure that Gk is Re- 
connected (Lemma l7.4p . Globally, from each i £ Vq we add an edge to i+2, ... ,i+ \k/2] , taken 
modulo cn. This ensures that the subgraph induced by Vq is fc-connected (Lemma IT.3j) . Locally, 
this is implemented as follows. If € [cn] 2 is queried, if the edge is already in Gk-i, then the 
local reconstructor returns 1, otherwise it returns 1 if and only if j £ {i + l,i + 2, . . . , i + \k/2] 
(mod cn)} or i £ {j + l,j + 2, . . . ,j + [A?/2] (mod cn)}. 

7.3.3 Pseudocode for fc-connectivity given (A; — l)-connectivity 

The pseudocode for the fc-connectivity reconstructor is given below. It assumes the input graph 
Gk-i is (k — l)-connected. 



procedure /c-Conn(ui, v 2 ) 

if (k — 1)-Conn(?;i, V2) = 1 then > if edge is already in graph, it stays in the graph 

return 1 
else 

if v\ Vq and v 2 $ Vq then > do not add edge if neither endpoint is in Vq 

return 
else if Ui,t>2 £ then 

if v\ £ {V2 ± 1, . . . , «2 i f A; / 2~| (mod cn)} then 

return 1 
else 

return 
end if 
else 

Let v £ {vi,v 2 } \ Vo 

Let t/ £ {v 1, ^2} \ {w} > Decide if u should be connected to v' £ Vq 

b 4— false > connect v if 6 is true 

for i = l,...,t 2 log(Cn) do 

if ExtremeSetSearch^ , k — 1) succeeds then 
U <— ExtremeSetSearch(-u, k — 1) 
if v has minimal rank in J7 then 

b «— TRUE 
end if 
Break 
end if 
end for 

if b is false and r(v) < ln ( Cn /*) then 

6 <- TRUE 
end if 

if b is true and ?/ is lexicographically smallest in h(v) \ Ng^^v) then 

return 1 
else 

return 
end if 
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end if 
end if 
end procedure 



7.3.4 Analysis 

The following lemma ensures the subgraph induced by Vq after adding these edges is ^-connected. 

Lemma 7.3. For n>2k + l, let V = {0, 1, . . . , n — 1}, n > 2k + 1, and E = {(i, i + j (mod n)) \ 
i 6 V, 1 < j < k}. Then G = (V,E) is 2k-connected. 

Proof. We claim it suffices to show that for every i, there are 2k edge-disjoint paths connecting i 
to i + 1 (mod n). Suppose this is true. Consider any cut (C, V \ C) of the vertices. To show that 
this cut has 2k cut edges, it suffices to show that there exist i G C , j G V \C with 2k edge-disjoint 
paths connecting i to j. But there exists i such that 2 G C and 2 + 1 € V \ C. Hence our claim 
implies the assertion. 

It remains to prove our claim. By symmetry, we may assume i = 0. We will exhibit 2k 
explicit edge-disjoint paths from to 1. For j = 2,... ,k, we have paths which traverse edges 
(0, j), + (j + 1, 1) as well as paths which traverse edges (0, n— j), (n—j,n—j + l), (n—j+1, 1). 
This accounts for 2k — 2 paths which are pairwise edge-disjoint. Note that, in these 2k — 2 paths, 
the only edges of the form (a, a + 1) used are for a G {2, . . . , k} U {n — k, . . . , n — 2}, and the only 
edges for the form (6, b + k) used are for b G {0, 1, n — k, n — k + 1}. The final two paths are the 
path which is simply the edge (0, 1), and the path which traverses edges (0, n — 1), (n — 1, n — k — 
1), (n - k - 1, n - k - 2), (n - k - 2, n - k - 3), . . . , (k + 3, A; + 2), (k + 2, 2), (2, 1). □ 

The following lemma ensures that the resulting graph is fe-connected. 

Lemma 7.4. With probability at least 1 — 7, the resulting graph Gt is k-connected. 

Proof. We first show that with probability at least 1 — 7, every extreme set of Gt-i has a vertex 
which adds an edge to Vo- First, consider each small extreme set, i.e. extreme sets of size at most 
t. For each such set U, there is some v\j G U with minimal rank in U. By the choice of C, with 
probability at least 1 — 7/2 every vjj has a successful extreme set search and recognizes that it has 
minimal rank in U, hence every small extreme set contributes an edge to Vo- Next, consider each 
large extreme set, i.e. extreme sets of size more than t. The probability that a large extreme set 

contributes no edges, i.e. has no successful vertices, is at most ^1 — M^W*) ^ < Since there 

are at most n/t large extreme sets, the probability that every large extreme set contributes an edge 
is at least 1 — 7/2 by the choice of C. Hence with probability at least 1 — 7 every extreme set 
contributes an edge to V$. 

Now we assume that every extreme set contributes an edge to Vq and show that Gk is Re- 
connected. It suffices to show that, for every extreme set S in G^-i, one of the edges added by the 
procedure crosses between S and V \ S. We have three cases. 

• Vq C V \ S: Consider v G S of minimal rank. With high probability, the neighborhood U of 
v found by the search algorithm is contained in S, so v is also minimally ranked in U. There 
must exist v* G h(v) to which v is not already connected, for if not, then v is connected to 
\h(v)\ > k vertices outside of S, contradicting that S is (k — l)-extreme. 
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• Vo C S: We claim there is another extreme set S' ^ S in Suppose not. Then for 
every subset S" Q V \ S, deg(S") > k, for otherwise V \ S contains an extreme set. But 
deg(V \ S) = deg(S) = k — 1, so V \ S is an extreme set, a contradiction. Now, recall that 
S' is disjoint from S. By the same argument as in the previous case, with high probability 
there is v 6 S' which gets connected to some super-node in Vo Q S. 

• Vo fl S and Vo \ S are both non-empty: We claim that in G^, S has degree deg(S') > k. 
Consider the cut (Vo fl S, Vo \ S) within the subgraph Vo- Any cut edge here must be a 
cut edge in the cut (S,V \ S). By Lemma E2J the subgraph Vo is fe-connected, so the cut 
(Vo H S,Vo\S), and hence the cut (S, V \S), has at least k cut edges. 

□ 

The following lemma shows that with high probability not too many additional edges are added. 
Lemma 7.5. With probability at least 1 — 5, the number of edges added is at most 

n\n(Cn/t) 



2em + ckn/2 + 



St 



Proof. Each t-bounded extreme set will contribute at most one edge. There are at most 2em 
extreme sets. On average 1/t of the remaining vertices contribute edges. Finally, the super-nodes 
themselves contribute at most ckn/2 edges. Therefore we add at most 2em + ckn/2 + X edges, 
where X is a random variable with mean K[X] < raln (^' w / t ) _ Markov's inequality implies 



Pr 



X > 



i 

E[X] 



< 5. 



□ 



Proof of Theorem \3.4\ Set C = i n (i/(\_ 7 )) anci t = • By Lemma 17.41 our resulting graph is 

fc-connected with probability at least 1 — 7 and by Lemma [7. 51 dist(Gjt_x, Gfe) < (2+a)e+ck/2 with 
probability 1 — 5 and therefore by induction dist(G, G&) < (2 + a)ke + ck 2 /2. This can be improved 
to (2 + a)ke + ck/2 by noting that the same super- nodes and the same edges between them can be 
used for all intermediate graphs G\,... ,Gk-i- The success probability is at least (1 — 5) k (1 — j) k > 
1 — k(5 + 7). The query complexity for correcting G^-i to G/. is 0(t 3 (t + k) \og(t + k) log(Cn)) 
queries to Nc k _ 1 - But each call to Nc k _ 1 takes O ((- + l) k) calls to Ec k _ 1 - By induction, the 

query complexity of E Gk is O (((\ + l) kf ■ t 3k (t + k) k \og k (t + k) \og k (Cn)) . □ 



8 Local reconstruction of small diameter 

In this section we prove Theorem 13.51 We first go over some preliminaries on graph diameter. We 
then give the high level description of the algorithm and prove some useful characteristics of graphs 
that are close to having small diameter before finally presenting the implementation and analysis 
of the algorithm. 
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8.1 Preliminaries 

Definition 8.1. Let G be a graph with adjacency matrix A. For an integer k, let G k be the graph 
on the same vertex set as G, with adjacency matrix A k (boolean arithmetic). 

It is not hard to show that there is an edge between u and v in G k if and only if there is a path 
of length at most k between u and v in G. This observation immediately gives us the following. 

Proposition 8.2. Let G be a graph and let D > be an integer. Then diamG < D if and only if 
G D is a complete graph. 

8.2 High level description 

The basic idea behind the algorithm is as follows. Again, we designate a "super-node" vq and add 
edges between vq and a few special vertices. If the input graph is close to having diameter at most 
D, then we aim for our reconstructed graph to have diameter at most 2D + 2. We show that if G is 
close to having diameter at most D, then we have an upper bound on the size of any independent 
set in G D (Lemma 18.31 and Corollary 18.4ft . 

Ideally, then, we want our special vertices (those that get an edge to vo) to be a dominating set 
in G D that is not too large. If we add edges from the dominating set in G D to Vo, then our resulting 
graph G has diameter at most 2D + 2, since any vertex can reach some vertex in the dominating set 
within D steps, and hence vq within D + 1 steps. If this dominating set is a maximal independent 
set, then our upper bound on the size of independent sets in G D also upper bounds the number 
of edges we add. However, all known algorithms for locally computing a maximal independent set 
have query complexity bounded in terms of the maximum degree of the graph. A variant of the 
algorithm found in p3] has been analyzed by [22] and [16] to run in expected time bounded in 
terms of the average degree of the graph, but this average is taken over not only coin tosses of the 
algorithm but also all possible queries. This is undesirable for us since we want a uniform bound 
on the query complexity for any potential query vertex, given a "good" set of coin tosses. We want 
an algorithm such that, for most sets of coin tosses, we get the correct answer everywhere, whereas 
the algorithm of p3] leaves open the possibility of failure for some queries, regardless of the coin 
tosses. If some queries give the wrong answer, then the fact that our reconstructed graph retains 
the property is compromised. Instead, we do something less optimal but good enough. Note that 
adding edges does not increase diameter. Instead of using a maximal independent set, we settle for 
a dominating set in G D as long as we can still control its size. 

8.3 Properties of graphs close to having small diameter 

The following lemma and subsequent corollary state that if a graph G is close to having diameter 
at most D, then no independent set in G D can be very large. 

Lemma 8.3. Let v\,...,v^ be an independent set in G D and let H be the subgraph of G D induced 
by this set. Let (s,t) be an edge not in E(G) and let G' = (V{G),E(G) U {(s,t)}) be the graph 
obtained by adding (s,t) to G. Let H' be the subgraph of (G') D induced by V\,...,V}-. Then, 
for some i € {1, . . . ,k}, all edges in H' (if any) are incident to Vi. In particular, if G D has an 
independent set of size k, then {G') D has an independent set of size k — 1. 
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Proof. We will prove the assertion as follows. First, we will show that any two edges in H' must 
share a vertex. Then, we show that any third edge must also be incident to that vertex. This 
implies that all edges share a common vertex. 

Suppose that the edges (vi, Vj) and (vk, vi) are in H' . Then without loss of generality the paths 
from Vi to Vj and from Vk to vi in G' must use (s, t). Let a = d(vi, s), b = d(t, Vj), c = d(vk, s) and 
d = d(t, vi), so we have 

a + b + l <D 
c + d + l < D. 

Adding yields a + b + c + d < 2D — 2, so at least one of (a + c), (b + d) must be less than or equal 
to D. Without loss of generality, suppose a + c < D. This implies that d(vi,Vk) < D even in G, so 
it must be that V{ = Vk- 

Now suppose yet another edge is in H'. By what we just showed, it must share an incident 
vertex with (vi, Vj), and it must do likewise with (v i, v{). So either it is incident to Vi, in which case 
we are done, or the edge is (vj, v{). We will show the latter cannot happen. Define a, b, d as above. 
Note that we have 

a + d+ 1 < D. 

Suppose there is path of length < D from Vj to v\ in G' which traverses (s, t) by entering via s and 
exiting via t. Let e = d(vj, s). Then we have 

e + d + l < D. 

However, since any path from Vi to Vj that does not traverse (s, t) must have length greater than 
D, we have a + e > D from which we can deduce 

e > b + 1. 

Similarly, since any path from Vj to v\ that does not traverse (s, t) must have length greater than 
D, we have 

b + d > D. 

But then D>e + d+ l>b + d + 2>D + 2, a contradiction. □ 

Corollary 8.4. Suppose G is e-close to having diameter < D. Then any independent set in G D 
has size at most em + 1 . 

Proof. Suppose G D has an independent set of size em + 2. There exist a set of em edges such that, 
when added to G, we obtain G' with diam(G') < D, and hence (G') D is a complete graph. But, 
by Lemma |8.3| (G') D has an independent set of size 2, a contradiction. □ 

Finally, we will use a result from [2] which we will restate in our own terms below: 

Theorem 8.5 (O Theorem 3.1]). Any connected graph G is {jy^)-close to having diameter < D. 

This result implies that any connected graph is e-close to having diameter at most — , and will 
come in handy when we want to bound our query complexity. In particular, the query complexity 
of our algorithm will depend on D, but if D > which is O(l) for constant e, then we can instead 
simply aim for diameter and bound our query complexity in terms of this constant instead. 
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8.4 Algorithm 

We start by fixing a super-node vq E G. Given our discussion in the high level description, it 
remains to implement the selection of a small dominating set. To this end, we create a dominating 
set S by first adding the set H of high-degree vertices into S and then using the local maximal 
independent set algorithm found in [19] , which is based on Luby's algorithm ([H]), on G D with H 
and its vertices' neighbors (in G D ) removed to create an independent set M. Then let S = HUM. 

By high-degree vertex we mean a vertex with degree greater than d/e, of which there are at 
most en, where d = — is a bound on the average degree. We will also consider the super-node 
Vq a high-degree vertex for our purposes. To implement choosing a maximal independent set, we 
define a subroutine MISo(u) as follows. On input v, simulate the local maximal independent set 
described in [19] , except explore all neighbors within D steps from v rather than just immediate 
neighbors, and automatically reject if the algorithm encounters any vertex with degree exceeding 
d/e. Note that the running time of the local MIS algorithm given in [19] is bounded in terms of the 
maximum degree of the graph. This is not problematic since our variant of the algorithm ignores 
any vertex with degree greater than d/e, hence the effective maximum degree of our graph G for 
the purposes of the algorithm is d/e, so the effective maximum degree of G D is (d/e) D . 

One final challenge is that if D is large, say O(logn), then our query complexity bound in terms 
of our effective degree is no longer sublinear. We work around this by using a result of [2], which 
states that every connected graph is e-close to having diameter at most ^ = 0(l/e). Therefore, we 
can aim for achieving diameter K = min{D, — } so that our effective degree is (d/e) K = {d/ e)°^ l ^ e \ 
Of course, this only works if our graph is connected to begin with. Therefore, we use the neighbor 
oracle for the connected correction G' of G, given in Section [6l The idea is then to first make G 
into a connected graph G', and then reconstruct a small diameter graph G out of G'. 



procedure SmallDiam^i, V2, D) 




K <- min|-^,m 

L em ' J 




if Eg*{vi, V2) = 1 then 


> if edge is already in graph, it stays in the graph 


return 1 




else 




if vo £ {^1,^2} then 


t> do not add edge if neither endpoint is 


return 




else 




Let v G {vi,v 2 } \ {v } 


> check if v is special 


if degQ>(v) > d/e then 




return 1 




else 




return MIS^O) 


> neighbor queries to G' instead of G 


end if 




end if 




end if 




end procedure 





Proof of Theorem W7b\ Correcting G to G' with Mod-Connect adds (1 + a)em + cm edges. The 
query complexity is clearly dominated by that of MIS^(v). The probability of success follows from 



18 



that of the local MIS algorithm. Now we show correctness. Let H be the set of vertices in G with 
degree greater than d/e, and let M be the set of vertices for which MIS(u) = 1. Observe that 
M is an independent set in G D , so \M\ < em. Note that since we forced vq to be in this union, 
the number of edges we added is actually \H U M \ {vq}\ = \H\ + \M\ — 1 < en + em < 2em. 
Furthermore, let v £ V(G). There exists some u E H U M such that d(v,u) < K, otherwise v 
would have been added to M. Therefore, d(v , vq) < K + 1. For any other v' € V(G), we thus have 
d{v, v 1 ) < d{v, v ) + d{v ,v') < 2K + 2 < 2D + 2. □ 

References 

[1] N. Ailon, B. Chazelle, S. Comandur, D. Liu. Property-preserving data reconstruction. In Proc. 
15th International Symposium on Algorithms and Computation, pages 16-27, 2004. 

[2] N. Alon, A. Gyarfas, and M. Ruszinko. Decreasing the diameter of bounded degree graphs. 
J. Graph Theory 35 (2000), 161-172. 

[3] M. A. Bender and D. Ron. Testing properties of directed graphs: acyclicity and connectivity. 
Random Structures and Algorithms 20 (2002), 184-205. 

[4] A. Bhattacharyya, E. Grigorescu, M. Jha, K. Jung, S. Raskhodnikova, D. Woodruff. Lower 
bounds for local monotonicity reconstruction from transitive-closure spanners. In Proc. lJ^th 
International Workshop on Randomization and Computation, pages 448-461, 2010. 

[5] Z. Brakerski. Local property restoring. Manuscript, 2008. 

[6] B. Chazelle, C. Seshadhri. Online geometric reconstruction. In Proc. 22nd ACM Symposium 
on Computational Geometry, pages 386-394, 2006. 

[7] A. Derbakova, N. Correll, D. Rus. Decentralized self-repair to maintain connectivity and cov- 
erage in networked multi-robot systems. In Proc. IEEE International Conference on Robotics 
and Automation, 2011. 

[8] O. Goldreich and D. Ron. Property testing in bounded degree graphs. Algorithmica 32 (2002), 
302-343. 

[9] M. Jha and S. Raskhodnikova. Testing and reconstruction of Lipschitz functions with appli- 
cations to data privacy. In Proc. 52nd Annual IEEE Symposium on Foundations of Computer 
Science, pages 433-442. 

[10] S. Kale, Y. Peres, C. Seshadhri. Noise tolerance of expanders and sublinear expander recon- 
struction. In Proc. 49th Annual IEEE Symposium on Foundations of Computer Science, pages 
719-728, 2008. 

[11] M. Luby. A simple parallel algorithm for the maximal independent set problem. SI AM Journal 
on Computing, 15(4):1036-1053, 1986. 

[12] S. Marko, D. Ron. Approximating the distance to properties in bounded-degree and general 
sparse graphs. ACM Transactions on Algorithms 5(2) (2009). 

[13] K. Menger. Zur allgemeinen Kurventheorie. Fund. Math. 10: 96-115, 1927. 



19 



[14] H. N. Nguyen and K. Onak. Constant-time approximation algorithms via local improvements. 
In Proc. 49th Annual IEEE Symposium on Foundations of Computer Science, pages 327-336, 
2008. 

[15] D. Noar, D. Gusfield, C. Martel. A fast algorithm for optimally increasing the edge connectivity. 
SICOMP, 26(4):1139-1165, 1997. 

[16] K. Onak, D. Ron, M. Rosen, R. Rubinfeld. A near-optimal sublinear-time algorithm for ap- 
proximating the minimum vertex cover size. In Proc. 23rd Annual ACM-SIAM Symposium on 
Discrete Algorithms, pages 1123-1131, 2012. 

[17] M. Parnas and D. Ron. Testing the diameter of graphs. Random Structures and Algorithms 
20 (2002), 165-183. 

[18] M. Parnas, D. Ron, R. Rubinfeld. Tolerant property testing and distance approximation. 
Journal of Computer and System Sciences 72 (2006), 1012-1042. 

[19] R. Rubinfeld, G. Tamir, S. Vardi, and N. Xie. Fast local computation algorithms. In Proc. 2nd 
Symposium on Innovations in Computer Science, pages 223-238, 2011. 

[20] M. E. Saks and C. Seshadhri. Local monotonicity reconstruction. SIAM Journal on Computing 
39 (2010), 2897-2926. 

[21] E. Stump, A. Jadbabaie, V. Kumar. Connectivity management in mobile robot teams. In Proc. 
IEEE International Conference on Robotics and Automation, 2008. 

[22] Y. Yoshida, M. Yamamoto, H. Ito. An improved constant-time approximation algorithm for 
maximum matchings. In Proc. J^lst ACM Symposium on Theory of Computing, pages 225-234, 
2009. 



20 



